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DETAILED ACTION 

Response to Amendment 

1 . In response to the Office Action mailed 6/7/10, applicant has submitted an 
amendment filed 7/20/10. 

Claims 1, 3-7, 9-18, have been amended. Claim 8 has been cancelled. 
Response to Arguments 
1 . Applicant's arguments with respect to claims 1 and 3-18 have been considered 
but are moot in view of the new ground(s) of rejection. 

Applicant argues that Li does not teach different kinds of sources for the terms 
(Amendment, page 18), but as claimed this limitation does not necessarily require that 
the word-terms and word-classes are from separate and distinct sources. The claims 
only require that the combination be generated and that it includes a set of word-terms 
and a set of word-classes. A generator that generates a combination of terms from a 
SINGLE source that includes BOTH term word information and word class information 
would still read on the claims. 

Claim Rejections - 35 USC § 103 

1 . The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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2. Claims 1, 3, 5-7, 9-18, are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Li et al. ("Improving Latent Semantic indexing Based Classifier With 
Information Gain"), hereafter Li, in view of Diab et al.("An Unsupervised Method for 
Word Sense Tagging using Parallel Corpora"), hereafter Diab. 

As per Claim 1, 15, Li teaches a method (and corresponding apparatus, where 
the joint classifier is defined in the claim by its function identical to the method in Claim 
1) comprising: receiving, by a processor-based device, a communication that comprises 
a word that is a natural-language word ("natural language understanding... directing the 
user's call... matches a user's request", Section 4, Experimental Setup, paragraph 1; 
where users communicate by speaking in natural language which includes speaking 
natural language words) 

selecting by the processor-based device a plurality of terms wherein the selecting 
is based on an information-gain value of each term ("term-document matrix... each 
selected term... IG based term selection is implemented... terms are selected and used 
in the term-document matrix based on their discriminative power", Section 3; where the 
terms selected are IG based and sorted by their individual values) 

generating by the processor-based device a matrix, wherein (i) the matrix 
comprises a plurality of categories and a plurality of terms, and (ii) each term in the 
matrix is associated with at least one category ("term-document matrix M is formed by 
terms... selected term is mapped to a unique row vector and each category is mapped 
to a unique column vector", Section 3, especially paragraph 3; where the matrix is 
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formed by combining terms [i.e. word terms] and categories [word-classes], and in a 
matrix, the matrix cell corresponding to a specific row's term/word-term and a specific 
column's category/word-class associates the term and the category corresponding to a 
cell; Alternatively, words are naturally associated with some particular class [e.g. words 
that are verbs, medical words, English words, etc.] and "at least one category" as 
claimed does not necessarily refer to any category in particular so any word naturally 
reads on this claim limitation because words are, by virtue of what they are, part of 
some form of category) 

determining from the matrix, by the processor-based device, a category for the 
word ("LSI classifier... categorize an unknown document... derived from... as in LSI 
according to IG enhanced term-document matrix... similarity... n-best categories", 
Section 3; "user's request", Section 4; where categorizing a document by consequence 
categorizes the word in that document, and this is "joint classification" in the sense that 
it "jointly" uses both term information and category information to perform classification 
[i.e. "joint classifier is configured to determine at least one category for the words, by 
applying a combination of word information and word class information to the words", 
Specification, page 6]). 

Li fails to teach generating by the processor-based device a combination of 
terms, based on the word, comprising: (i) a set of word-terms and (ii) a set of word- 
classes, wherein a term is one of a word-term and a word-class, where the plurality of 
terms selected is from the combination of terms, where the terms assigned information- 
gain values are the combination of terms 
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Diab teaches/suggests generating by the processor-based device a combination 
of terms, based on the word, wherein a term is one of a word-term and a word-class 
comprising: (i) a set of word-terms and (ii) a set of word-classes, wherein a term is one 
of a word-term and a word-class ("word sense tagging... automatically sense 
annotating... large amounts of data... using an unsupervised algorithm... bootstrap... 
creating a sense-tagged corpus", Introduction, especially paragraph 3; "project the 
sense tags from the target side to the source side... KIND-OF-DRAMA sense... 
CALAMITY... the tagging... would yield... large number of French words will receive 
tags from the English sense inventory", Approach, especially 4th bullet, paragraph 
ending at the upper right of page 257, and last paragraph; Applicant does not claim that 
the word that the generation is based on was derived from the communication, so as 
long as the generated data includes the word that also exists in the communication, it is 
"based on the word" that the communication "comprises"; Diab teaches sense-tagging 
words to create a corpus of sense-tagged data. Each of these sense-tagged words 
includes a word-class like CALAMITY and a word-term like catastrophe. Diab teaches 
the existence of different senses [like KIND-OF-DRAMA] and it is at least obvious that 
catastrophe is not the only word with different senses in French. The sense-annotated 
words in the corpus, collectively, are a "combination of terms" because the 
classes/senses are combined with their corresponding words, and at the very least in 
set theory sets can include 1 element, or alternatively, collectively all of the sense- 
annotated words are a set of word-terms and their corresponding senses collectively 
are a set of word-classes) 
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where the plurality of terms selected is from the combination of terms, where the 
terms assigned information-gain values are the combination of terms ("word sense 
tagging... automatically sense annotating... large amounts of data... using an 
unsupervised algorithm... bootstrap... creating a sense-tagged corpus", Introduction, 
especially paragraph 3; "project the sense tags from the target side to the source side... 
KIND-OF-DRAMA sense... CALAMITY... the tagging... would yield... large number of 
French words will receive tags from the English sense inventory", Approach, especially 
4th bullet, paragraph ending at the upper right of page 257, and last paragraph; where 
Li [in Section 3, especially paragraph 2] teaches that the term-document matrix is 
generated from a labeled corpus, though it does not specifically state how the corpus is 
generated. Diab teaches generating a labeled corpus by a processor which contains 
the very information that Li wishes to extract for Li's matrix. Therefore, one of ordinary 
skill in the art can simply substitute the corpus used in Li with one generated from a 
processor as per Diab that contains the information needed to generate the matrix, and 
which can be used to bootstrap the classifier described in Li) 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of invention to perform a simple substitution of one corpus which the information- 
gain-matrix performs selection from with another, because Li teaches a classification 
method/device which differed from the claimed device by the substitution of a 
corpus containing word class information and word term information generated by some 
means used by Li to generated a matrix, with another corpus containing the same 
information derived/generated by a processor. Diab teaches that a corpus generated by 
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a processor and containing word class information and word term information was 
known in the art. One of ordinary skill in the art could have substituted one 
corpus for another to obtain the predictable results of a system which performs 
classification using a matrix generated from a corpus containing terms and classes (as 
per Li) where the corpus containing terms and classes is generated by a processor (as 
per Diab). 

As per Claim 3, 16, 18, Li teaches/suggests (along with its apparatus equivalent 
of Claim 1 6 and article of manufacture equivalent in Claim 1 8, where claim 1 8 includes 
the limitations of both Claims 1 and 3 and so incorporates the rejections presented 
above regarding claim 1 as well) routing the communication by the processor-based 
device to a particular one of a plurality of destination terminals of a communication 
system, wherein the routing is based on the category of the word, and wherein the 
communication system comprises the processor-based device and the plurality of 
destination terminals ("routing... appropriate destination within a call center", Section 4; 
where the routing to destinations in the call-center/system based on the query's 
categorization, which is done via the joint classifier, which includes categorizing the 
words in the query, and the "system" can be interpreted as the call router and all of the 
places that the call is routed to; and this is "joint classification" in the sense that it 
"jointly" uses both term information and category information to perform classification 
[i.e. "joint classifier is configured to determine at least one category for the words, by 
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applying a combination of word information and word class information to the words", 
Specification, page 6]). 

As per Claim 5, Li teaches selecting of the plurality of terms is further based on a 
percentile value applied to the respective information-gain values of each term in the 
combination of terms ("top p percentile... according to the IG score", Section 3; where 
the terms being in the combination of terms is addressed in the same manner as above 
by Diab in the parent claim, the set selected which is part of the corpus they are 
selected from can be interpreted as the combination of terms as well). 

As per Claim 6, Li teaches wherein the information-gain value for each term in 
the combination of terms, indicates the average entropy variations over a plurality of 
possible categories for each term in the combination of terms ("significance of the term 
based on the entropy variations of the categories, which relates to the perplexity of the 
classification task", Section 2; "literal terms... may not match those of a relevant 
document", Section 1, paragraph 1; "IG enhanced... classified... categorize an unknown 
document", Section 3; where the entropy variations are taught by Li to relate to 
perplexity and so an entropy calculation is also a perplexity calculation and Equation 1 
describes the information gain value being calculated from entropy/perplexity. Also the 
subscript ti at the end of Section 2 at least suggests that there is more than one term for 
which the information gain is calculated; where the terms being in the combination of 
terms is addressed above by Diab in the parent claim, the set selected which is part of 
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the corpus they are selected from can be interpreted as the combination of terms as 
well). 

As per Claim 7, 1 7, Li teaches (along with its apparatus equivalent of Claim 1 7) 
wherein the category of the word is a cell in the term-category matrix ("cell... j-th 
category", Section 3). 

As per Claim 9, Li fails to teach wherein the combination of terms is generated by 
interleaving individual word-terms with their corresponding word-classes. 

Diab teaches/suggests wherein the combination of terms is generated by 
interleaving individual word-terms with their corresponding word-classes ("word sense 
tagging... automatically sense annotating... large amounts of data... using an 
unsupervised algorithm... bootstrap... creating a sense-tagged corpus", Introduction, 
especially paragraph 3; "project the sense tags from the target side to the source side... 
KIND-OF-DRAMA sense... CALAMITY... the tagging... would yield... large number of 
French words will receive tags from the English sense inventory", Approach, especially 
4th bullet, paragraph ending at the upper right of page 257, and last paragraph; where 
word-sense tagging interleaves [mixes or inserts the sense tags regularly between 
words in the corpus] the sense/word-classes and their respective words/word-terms) 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of invention to perform a simple substitution of one corpus which the information- 
gain-matrix performs selection from with another, because Li teaches a classification 
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method/device which differed from the claimed device by the substitution of a 

corpus containing word class information and word term information generated by some 
means used by Li to generated a matrix, with another corpus containing the same 
information derived/generated by a processor. Diab teaches that a corpus generated by 
a processor and containing word class information and word term information was 
known in the art. One of ordinary skill in the art could have substituted one 
corpus for another to obtain the predictable results of a system which performs 
classification using a matrix generated from a corpus containing terms and classes (as 
per Li) where the corpus containing terms and classes is generated by a processor (as 
per Diab). 

As per Claim 10, Li teaches/suggests a method comprising: receiving, by a 
processor-based device, a communication that comprises at least one word, wherein 
each of the at least one word that is a natural-language word ("natural language 
understanding... directing the user's call... matches a user's request", Section 4, 
Experimental Setup, paragraph 1 ; where users communicate by speaking in natural 
language which includes speaking natural language words) 

selecting by the processor-based device a plurality of terms wherein the selecting 
is based on an information-gain value of each term ("term-document matrix... each 
selected term... IG based term selection is implemented... terms are selected and used 
in the term-document matrix based on their discriminative power", Section 3; where the 
terms selected are IG based and sorted by their individual values) 
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generating by the processor-based device a term-category matrix, wherein (i) the 
term-category matrix comprises a plurality of terms and a plurality of categories, and (ii) 
each term in the matrix is associated with at least one category ("term-document matrix 
M is formed by terms... selected term is mapped to a unique row vector and each 
category is mapped to a unique column vector", Section 3, especially paragraph 3; 
where the matrix is formed by combining terms [i.e. word terms] and categories [word- 
classes], and in a matrix, the matrix cell corresponding to a specific row's term/word- 
term and a specific column's category/word-class associates the term and the category 
corresponding to a cell; Alternatively, words are naturally associated with some 
particular class [e.g. words that are verbs, medical words, English words, etc.] and "at 
least one category" as claimed does not necessarily refer to any category in particular 
so any word naturally reads on this claim limitation because words are, by virtue of what 
they are, part of some form of category) 

classifying the communication by utilizing a joint classifier upon the at least one 
word, wherein the joint classifier comprises the term-category matrix ("LSI classifier... 
categorize an unknown document... derived from... as in LSI according to IG enhanced 
term-document matrix... similarity... n-best categories", Section 3; "user's request", 
Section 4; where categorizing a document by consequence categorizes the word in that 
document, and this is "joint classification" in the sense that it "jointly" uses both term 
information and category information to perform classification [i.e. "joint classifier is 
configured to determine at least one category for the words, by applying a combination 
of word information and word class information to the words", Specification, page 6]). 
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Li fails to teach generating by the processor-based device a combination of 
terms, based on the word, comprising: (i) a set of word-terms and (ii) a set of word- 
classes, wherein a term is one of a word-term and a word-class, where the plurality of 
terms selected is from the combination of terms, where the terms assigned information- 
gain values are the combination of terms 

Diab teaches/suggests generating by the processor-based device a combination 
of terms, based on the word, wherein a term is one of a word-term and a word-class 
comprising: (i) a set of word-terms and (ii) a set of word-classes, wherein a term is one 
of a word-term and a word-class ("word sense tagging... automatically sense 
annotating... large amounts of data... using an unsupervised algorithm... bootstrap... 
creating a sense-tagged corpus", Introduction, especially paragraph 3; "project the 
sense tags from the target side to the source side... KIND-OF-DRAMA sense... 
CALAMITY... the tagging... would yield... large number of French words will receive 
tags from the English sense inventory", Approach, especially 4th bullet, paragraph 
ending at the upper right of page 257, and last paragraph; Applicant does not claim that 
the word that the generation is based on was derived from the communication, so as 
long as the generated data includes the word that also exists in the communication, it is 
"based on the word" that the communication "comprises"; Diab teaches sense-tagging 
words to create a corpus of sense-tagged data. Each of these sense-tagged words 
includes a word-class like CALAMITY and a word-term like catastrophe. Diab teaches 
the existence of different senses [like KIND-OF-DRAMA] and it is at least obvious that 
catastrophe is not the only word with different senses in French. The sense-annotated 
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words in the corpus, collectively, are a "combination of terms" because the 
classes/senses are combined with their corresponding words, and at the very least in 
set theory sets can include 1 element, or alternatively, collectively all of the sense- 
annotated words are a set of word-terms and their corresponding senses collectively 
are a set of word-classes) 

where the plurality of terms selected is from the combination of terms, where the 
terms assigned information-gain values are the combination of terms ("word sense 
tagging... automatically sense annotating... large amounts of data... using an 
unsupervised algorithm... bootstrap... creating a sense-tagged corpus", Introduction, 
especially paragraph 3; "project the sense tags from the target side to the source side... 
KIND-OF-DRAMA sense... CALAMITY... the tagging... would yield... large number of 
French words will receive tags from the English sense inventory", Approach, especially 
4th bullet, paragraph ending at the upper right of page 257, and last paragraph; where 
Li [in Section 3, especially paragraph 2] teaches that the term-document matrix is 
generated from a labeled corpus, though it does not specifically state how the corpus is 
generated. Diab teaches generating a labeled corpus by a processor which contains 
the very information that Li wishes to extract for Li's matrix. Therefore, one of ordinary 
skill in the art can simply substitute the corpus used in Li with one generated from a 
processor as per Diab that contains the information needed to generate the matrix, and 
which can be used to bootstrap the classifier described in Li) 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of invention to perform a simple substitution of one corpus which the information- 
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gain-matrix performs selection from with another, because Li teaches a classification 
method/device which differed from the claimed device by the substitution of a 
corpus containing word class information and word term information generated by some 
means used by Li to generated a matrix, with another corpus containing the same 
information derived/generated by a processor. Diab teaches that a corpus generated by 
a processor and containing word class information and word term information was 
known in the art. One of ordinary skill in the art could have substituted one 
corpus for another to obtain the predictable results of a system which performs 
classification using a matrix generated from a corpus containing terms and classes (as 
per Li) where the corpus containing terms and classes is generated by a processor (as 
per Diab). 

As per Claim 1 1 , Li teaches wherein a cell l,j, of the term-category matrix 
represents a classification by the processor-based device of an i-th selected term into a 
j-th category ("LSI classifier... categorize an unknown document... similarity... n-best 
categories", Section 3; "user's request", Section 4; where categories in the matrix are 
among the j categories and categorizing a request including terms categorizes it into a 
category among the categories numbered by the values of j. Cells of matrices 
correspond to a particular row and common and in the case of a cell corresponding to a 
word/row and class/column, the cell represents an association between a word and the 
corresponding class) 
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As per Claim 12, Li teaches/suggests a method comprising: receiving, by a 
processor-based device, a communication that comprises a word that is a natural- 
language word ("natural language understanding... directing the user's call... matches a 
user's request", Section 4, Experimental Setup, paragraph 1; where users communicate 
by speaking in natural language which includes speaking natural language words) 

selecting by the processor-based device a plurality of terms wherein the selecting 
is based on an information-gain value of each term ("term-document matrix... each 
selected term... IG based term selection is implemented... terms are selected and used 
in the term-document matrix based on their discriminative power", Section 3; where the 
terms selected are IG based and sorted by their individual values) 

wherein the selecting comprises: i) calculating an information gain value for each 
term that corresponds to the word ("terms are selected and used... according to IG 
criterion... sort the terms", Section 3; "terms in documents", Section 1, paragraph 1; 
where sorting terms by their IG values means that each term had its IG value calculated 
such that they can be sorted, and the terms are in documents that communicate 
information [received at the input to the classification system], and documents contain 
words and so the terms in this context are words) 

ii) sorting the terms in the union of terms in a descending order of information 
gain values ("sort the terms by their IG values in descending order", Section 3) 

iii) setting a threshold of an information gain value corresponding to a specified 
percentile ("select top p percentile of terms according to the IG score distribution", 
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Section 3; where taking the top p percentile sets the lowest of that p percentile as the 
threshold IG score) 

iv) selecting only the terms having an information gain value greater than or 
equal to the threshold to generate the plurality of terms ("select top p percentile of 
terms", Section 3; where taking the top p percentile takes all terms exceeding the lowest 
IG value in that percentile and excludes everything falling below the percentile). 

Li fails to teach generating by the processor-based device a combination of 
terms, based on the word, comprising: (i) a set of word-terms and (ii) a set of word- 
classes, wherein a term is one of a word-term and a word-class, where the plurality of 
terms selected is from the combination of terms, where the terms assigned information- 
gain values are the combination of terms, where the plurality of terms selected is from 
the combination of terms, where the terms assigned information-gain values are the 
combination of terms, and where the terms are from the combination of terms 

Diab teaches/suggests generating by the processor-based device a combination 
of terms, based on the word, wherein a term is one of a word-term and a word-class 
comprising: (i) a set of word-terms and (ii) a set of word-classes, wherein a term is one 
of a word-term and a word-class ("word sense tagging... automatically sense 
annotating... large amounts of data... using an unsupervised algorithm... bootstrap... 
creating a sense-tagged corpus", Introduction, especially paragraph 3; "project the 
sense tags from the target side to the source side... KIND-OF-DRAMA sense... 
CALAMITY... the tagging... would yield... large number of French words will receive 
tags from the English sense inventory", Approach, especially 4th bullet, paragraph 
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ending at the upper right of page 257, and last paragraph; Applicant does not claim that 
the word that the generation is based on was derived from the communication, so as 
long as the generated data includes the word that also exists in the communication, it is 
"based on the word" that the communication "comprises"; Diab teaches sense-tagging 
words to create a corpus of sense-tagged data. Each of these sense-tagged words 
includes a word-class like CALAMITY and a word-term like catastrophe. Diab teaches 
the existence of different senses [like KIND-OF-DRAMA] and it is at least obvious that 
catastrophe is not the only word with different senses in French. The sense-annotated 
words in the corpus, collectively, are a "combination of terms" because the 
classes/senses are combined with their corresponding words, and at the very least in 
set theory sets can include 1 element, or alternatively, collectively all of the sense- 
annotated words are a set of word-terms and their corresponding senses collectively 
are a set of word-classes) 

where the plurality of terms selected is from the combination of terms, where the 
terms assigned information-gain values are the combination of terms, and where the 
terms are from the combination of terms ("word sense tagging... automatically sense 
annotating... large amounts of data... using an unsupervised algorithm... bootstrap... 
creating a sense-tagged corpus", Introduction, especially paragraph 3; "project the 
sense tags from the target side to the source side... KIND-OF-DRAMA sense... 
CALAMITY... the tagging... would yield... large number of French words will receive 
tags from the English sense inventory", Approach, especially 4th bullet, paragraph 
ending at the upper right of page 257, and last paragraph; where Li [in Section 3, 
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especially paragraph 2] teaches that the term-document matrix is generated from a 
labeled corpus, though it does not specifically state how the corpus is generated. Diab 
teaches generating a labeled corpus by a processor which contains the very information 
that Li wishes to extract for Li's matrix. Therefore, one of ordinary skill in the art can 
simply substitute the corpus used in Li with one generated from a processor as per Diab 
that contains the information needed to generate the matrix, and which can be used to 
bootstrap the classifier described in Li) 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of invention to perform a simple substitution of one corpus which the information- 
gain-matrix performs selection from with another, because Li teaches a classification 
method/device which differed from the claimed device by the substitution of a 
corpus containing word class information and word term information generated by some 
means used by Li to generated a matrix, with another corpus containing the same 
information derived/generated by a processor. Diab teaches that a corpus generated by 
a processor and containing word class information and word term information was 
known in the art. One of ordinary skill in the art could have substituted one 
corpus for another to obtain the predictable results of a system which performs 
classification using a matrix generated from a corpus containing terms and classes (as 
per Li) where the corpus containing terms and classes is generated by a processor (as 
per Diab). 
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As per Claim 13, Li teaches wherein the selected terms in the plurality of terms 
are processed by the processor-based device to form a term-category matrix from 
which a joint classifier determines at least one category for the word, and wherein the 
processor-based device comprises the joint classifier ("LSI classifier... categorize an 
unknown document... derived from... as in LSI according to IG enhanced term- 
document matrix... similarity... n-best categories", Section 3; "user's request", Section 
4; where categorizing a document by consequence categorizes the word in that 
document, and this is "joint classification" in the sense that it "jointly" uses both term 
information and category information to perform classification [i.e. "joint classifier is 
configured to determine at least one category for the words, by applying a combination 
of word information and word class information to the words", Specification, page 6]). 

As per Claim 14, Li teaches generating by the processor-based device a term- 
category matrix, wherein (i) the term-category matrix comprises a plurality of terms and 
a plurality of categories, and (ii) each term in the matrix is associated with at least one 
category ("term-document matrix M is formed by terms... selected term is mapped to a 
unique row vector and each category is mapped to a unique column vector", Section 3, 
especially paragraph 3; where the matrix is formed by combining terms [i.e. word terms] 
and categories [word-classes], and in a matrix, the matrix cell corresponding to a 
specific row's term/word-term and a specific column's category/word-class associates 
the term and the category corresponding to a cell; Alternatively, words are naturally 
associated with some particular class [e.g. words that are verbs, medical words, English 
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words, etc.] and "at least one category" as claimed does not necessarily refer to any 
category in particular so any word naturally reads on this claim limitation because words 
are, by virtue of what they are, part of some form of category) 

selecting from the term-category matrix, by the processor-based device, a 
category for the word ("LSI classifier... categorize an unknown document... derived 
from... as in LSI according to IG enhanced term-document matrix... similarity... n-best 
categories", Section 3; "user's request", Section 4; where categorizing a document by 
consequence categorizes the word in that document, and this is "joint classification" in 
the sense that it "jointly" uses both term information and category information to perform 
classification [i.e. "joint classifier is configured to determine at least one category for the 
words, by applying a combination of word information and word class information to the 
words", Specification, page 6]) 

routing the communication by the processor-based device to a particular one of a 
plurality of destination terminals of a communication system, wherein the routing is 
based on the category of the word, and wherein the communication system comprises 
the processor-based device and the plurality of destination terminals ("routing... 
appropriate destination within a call center", Section 4; where the routing to destinations 
in the call-center/system based on the query's categorization, which is done via the joint 
classifier, which includes categorizing the words in the query, and the "system" can be 
interpreted as the call router and all of the places that the call is routed to). 
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1 . Claim 4 is rejected under 35 U.S.C. 103(a) as being unpatentable over Li, in view 
of Diab, as applied to Claim 1 , above, and further in view of Sakai et al. (US 7,099,81 9), 
hereafter Sakai. 

As per Claim 4, Li, in view of Diab, fail to teach wherein an automatic word class 
clustering algorithm is utilized to generate the word-classes. 

Sakai teaches wherein an automatic word class clustering algorithm is utilized to 
generate the word-classes ("category decision rules... each text is classified to a 
category according to the category decision rule", col. 3, lines 35-50; "automatically 
creates a new category", col. 6, line 53 - col. 7, line 5; "if a cluster consisting of a large 
number of texts... new category to which this cluster is classified", col. 6, lines 34-40; 
"cluster generation unit", col. 6, lines 7-24; where the clustering is automatically 
performed and whose results is used for a new word class, and so it is an automatic 
word class clustering algorithm and is used to generate new word class [i.e., category] 
rules/information. Li and Diab teach where categories/senses are generated somehow 
[since they must have been derived from somewhere to be used in tagging], without 
providing the specifics. Sakai teaches another method for generating the same data 
and so a simple substitution of the generation can be performed to yield the class/sense 
information used in Diab's sense tagging ). 

Therefore, it would have been obvious to one of ordinary skill in the art to perform 
a simple substitution of one category with another, because Li and Diab teach a 
device which differed from the claimed device by the substitution of categories 
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generated by some unspecified manner with categories generated by clustering. Sakai 
teaches that categories generated by clustering were known in the art. One of 
ordinary skill in the art could have substituted one known element for another by 

using categories generated from clustering instead in the unsupervised tagging in Diab 
in order to obtain the predictable results of a system that performs classification 
based on a matrix derived from word class and word term data (Li) where the word 
class and word term data are automatically generated by a processor (Diab) based on 
classes determined initially using a form of clustering (Sakai). 

Conclusion 

2. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. See PTO-892. 

3. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See M PEP 

§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
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extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to ERIC YEN whose telephone number is (571)272-4249. 
The examiner can normally be reached on M-F 7:30-4:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on 571-272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

EY 7/29/10 

/Eric Yen/ 

Examiner, Art Unit 2626 



